solve norm layer false negtive gap by Gasoonjia · Pull Request #16642 · pytorch/executorch

Gasoonjia · 2026-01-15T23:08:55Z

Stack from ghstack (oldest at bottom):

-> solve norm layer false negtive gap #16642

When comparing AOT intermediate outputs with runtime, we believed that AOT and runtime should have same output for same operator. But if there're multiple intermediate outputs from a single operator / single operator blob, the statement may not correct. Like drop out, which only record output tensor during AOT, but in runtime we record both mask and output tensor.

To support that, for 1 to many scenerio, instead of only take the last element for comparsion, we compare the runtime output sharing the same size and dtype with the aot one to have the best comparsion.

Differential Revision: D90790256

When comparing AOT intermediate outputs with runtime, we believed that AOT and runtime should have same output for same operator. But if there're multiple intermediate outputs from a single operator / single operator blob, the statement may not correct. Like drop out, which only record output tensor during AOT, but in runtime we record both mask and output tensor. To support that, for 1 to many scenerio, instead of only take the last element for comparsion, we compare the runtime output sharing the same size and dtype with the aot one to have the best comparsion. Differential Revision: [D90790256](https://our.internmc.facebook.com/intern/diff/D90790256/) [ghstack-poisoned]

pytorch-bot · 2026-01-15T23:08:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16642

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 7 New Failures, 3 Cancelled Jobs, 2 Unrelated Failures

As of commit ccb1daf with merge base b46f6b5 ():

NEW FAILURES - The following jobs have failed:

pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
RuntimeError: Command docker exec -t 93ff681473e95b4f966b9041587d1ff0e28e33080acbd4b5a37c0695dfeecc3d /exec failed with exit code 139
pull / unittest / linux / linux-job (gh)
devtools/inspector/tests/inspector_test.py::TestInspector::test_transformer_block_xnnpack_numeric_gap_within_tolerance
pull / unittest / macos / macos-job (gh)
devtools/inspector/tests/inspector_test.py::TestInspector::test_transformer_block_xnnpack_numeric_gap_within_tolerance
pull / unittest / windows / windows-job (gh)
devtools/inspector/tests/inspector_test.py::TestInspector::test_transformer_block_xnnpack_numeric_gap_within_tolerance
pull / unittest-editable / linux / linux-job (gh)
devtools/inspector/tests/inspector_test.py::TestInspector::test_transformer_block_xnnpack_numeric_gap_within_tolerance
pull / unittest-editable / macos / macos-job (gh)
devtools/inspector/tests/inspector_test.py::TestInspector::test_transformer_block_xnnpack_numeric_gap_within_tolerance
pull / unittest-editable / windows / windows-job (gh)
devtools/inspector/tests/inspector_test.py::TestInspector::test_transformer_block_xnnpack_numeric_gap_within_tolerance

CANCELLED JOBS - The following jobs were cancelled. Please retry:

pull / test-samsung-models-linux / linux-job (gh)
##[error]The operation was canceled.
pull / test-static-llama-qnn-linux (stories_110m) / linux-job (gh)
##[error]The operation was canceled.
pull / test-static-llama-qnn-linux (stories_260k_bc) / linux-job (gh)
##[error]The operation was canceled.

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

Test CUDA Builds / test-model-cuda-e2e (openai, whisper-small, quantized-int4-weight-only) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
Test Metal Backend / export-model-metal-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / macos-job (gh) (matched macos rule in flaky-rules.json)
File doesn't exist

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-01-15T23:09:50Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

When comparing AOT intermediate outputs with runtime, we believed that AOT and runtime should have same output for same operator. But if there're multiple intermediate outputs from a single operator / single operator blob, the statement may not correct. Like drop out, which only record output tensor during AOT, but in runtime we record both mask and output tensor. To support that, for 1 to many scenerio, instead of only take the last element for comparsion, we compare the runtime output sharing the same size and dtype with the aot one to have the best comparsion. Differential Revision: [D90790256](https://our.internmc.facebook.com/intern/diff/D90790256/) [ghstack-poisoned]

Pull Request resolved: #16642 When comparing AOT intermediate outputs with runtime, we believed that AOT and runtime should have same output for same operator. But if there're multiple intermediate outputs from a single operator / single operator blob, the statement may not correct. Like drop out, which only record output tensor during AOT, but in runtime we record both mask and output tensor. To support that, for 1 to many scenerio, instead of only take the last element for comparsion, we compare the runtime output sharing the same size and dtype with the aot one to have the best comparsion. ghstack-source-id: 334110504 @exported-using-ghexport Differential Revision: [D90790256](https://our.internmc.facebook.com/intern/diff/D90790256/)

GregoryComer

I don't fully follow why we can't capture all of the reference outputs for each op to avoid needing to heuristically match, as both portable ops and delegate calls should have matching signatures AOT and and at runtime, but I'm assuming we have a good reason. I'll approve to unblock.

GregoryComer · 2026-01-17T00:17:48Z

devtools/inspector/_inspector_utils.py

+    Returns:
+        The matching runtime output, or runtime_intermediate_output[-1] as fallback.
+    """
+    # Find all runtime outputs that match the AOT shape


Is it practical to capture all of the outputs AOT so that we don't have to do this?

We are capturing all outputs from both AOT and runtime, but their outputs do not always match each other.
Take dropout as example, AOT only generate the function output, but runtime generates two: mask and real output.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 15, 2026

Gasoonjia mentioned this pull request Jan 15, 2026

tutorial for devtool debugibility #16643

Closed

meta-codesync bot added fb-exported meta-exported labels Jan 15, 2026

Gasoonjia added 3 commits January 15, 2026 16:02

GregoryComer approved these changes Jan 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

solve norm layer false negtive gap#16642

solve norm layer false negtive gap#16642
Gasoonjia wants to merge 4 commits intogh/gasoonjia/102/basefrom
gh/gasoonjia/102/head

Gasoonjia commented Jan 15, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 15, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

GregoryComer left a comment •

edited

Loading

Uh oh!

GregoryComer Jan 17, 2026

Uh oh!

Gasoonjia Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

Gasoonjia commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16642

❌ 7 New Failures, 3 Cancelled Jobs, 2 Unrelated Failures

Uh oh!

github-actions bot commented Jan 15, 2026

This PR needs a release notes: label

Uh oh!

GregoryComer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

GregoryComer Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

Gasoonjia Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Gasoonjia commented Jan 15, 2026 •

edited

Loading

pytorch-bot bot commented Jan 15, 2026 •

edited

Loading

This PR needs a `release notes:` label

GregoryComer left a comment •

edited

Loading